Reinforcement learning from human feedback

  • 2025-04-18 (modified: 2025-06-05)
  • 별칭: RLHF

인간의 피드백을 통한 강화학습.

Articles

2025 © ak